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ABSTRACT 



Elemental abundance patterns can provide vital clues to the formation and enrichment history 
of a stellar population. Here we present an investigation of the Galactic bulge, where we apply 
principal component abundance analysis (PCAA) — a principal component decomposition of relative 
abundances [X/Fe] — to a sample of 35 microlensed bulge dwarf and subgiant stars, characterizing 
their distribution in the 12-dimensional space defined by their measured elemental abundances. The 
first principal component PCI, which suffices to describe the abundance patterns of most stars in 
the sample, shows a strong contribution from a -elements, reflecting the relative contributions of 
Type II and Type la supernovae. The second principal component PC2 is characterized by a Na-Ni 
correlation, the likely product of metallicity-dependent Type II supernova yields. The distribution 
in PC 1 is bimodal, showing that the bimodality previously found in the [Fe/H] values of these stars 
is robustly and independently recovered by looking at only their relative abundance patterns. The 
two metal-rich stars that are oc -enhanced have outlier values of PC2 and PC3, respectively, further 
evidence that they have distinctive enrichment histories. Applying PCAA to a sample of local thin and 
thick disk dwarfs yields a nearly identical PCI; in PCI, the metal-rich and metal-poor bulge dwarfs 
track kinematically selected thin and thick disk dwarfs, respectively, suggesting broadly similar oc- 
enrichment histories. However, the disk PC2 is dominated by a Y-Ba correlation, likely indicating 
a contribution of s-process enrichment from long-lived asymptotic giant branch stars that is absent 
from the bulge PC2 because of its rapid formation. 

Key words: Galaxy: general — Galaxy: bulge — Galaxy: evolution — Galaxy: formation — 
Galaxy: stellar content — stars: abundances 
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1. Introduction 

Elemental abundance trends of Galactic bulge stars are crucial for understand- 
ing bulge formation. Galactic chemical evolution is best traced by dwarf stars be- 
cause their spectra are straightforward to analyze (Edvardsson et al. 1993). How- 
ever, observations of bulge dwarfs are challenging due to their faintness (V = 19-20; 
Feltzing and Gilmore 2000), impeding spectroscopic observation under normal cir- 
cumstances. Consequently, many studies have focused on giant stars, despite the 
difficulty in analyzing their spectra. This difficulty has led to shifts in the mean 
[Fe/H] and the [Fe/H] distribution function of the bulge as the analysis techniques 
are refined (e.g., Rich 1988, McWilliam and Rich 1994, Fulbright et al. 2007, Zoc- 
cali et al. 2008, Hill et al. 2011). Fortunately, gravitational microlensing offers a 
unique opportunity to observe bulge dwarfs. When a bulge dwarf is lensed by a 
foreground object, its brightness can increase by > 5 magnitudes, enabling spec- 
troscopic observations of sufficiently high resolution and signal-to-noise (S/N) for 
an abundance analysis {e.g., Minniti et al. 1998, Johnson et al. 2007, Bensby et al. 
201 1 and references therein). 

The abundances of different elements are correlated, reflecting their origin in 
a common nucleosynthetic process, such as Type II or Type la supernovae (SNe). 
However, these correlations are not perfect because the same element can be made 
in multiple nucleosynthetic processes, whose relative contributions to a star are 
best distinguished if abundances are measured for many elements. Here we an- 
alyze a sample of 35 bulge dwarf^], all of which have at least seven measured 
elemental abundances and the majority (24/35) of which have 11 or 12 measured 
elemental abundances, with median errors a < 0.25 dex. Bensby et al. (2011) 
find that the bulge dwarf [Fe/H] distribution function is bimodal, peaked at [Fe/H] 
« —0.6 and +0.3. They further show that the a -element abundances in the bulge 
dwarfs vary systematically with [Fe/H], following the trends found for thin and 
thick disk dwarfs (Bensby et al. 2003, 2005, Reddy et al. 2003, 2006). Here we re- 
visit this data set with a different analysis technique based on principal component 
decomposition of the elemental abundance patterns, showing that the bimodality 
seen in [Fe/H] also appears in the relative elemental abundance patterns. 

Principal component analysis (PCA) is a natural tool for characterizing cor- 
relations in a high-dimensional space, reducing the overall dimensionality of the 
data set while allowing the data themselves to reveal the strongest patterns of cor- 
relations. While PCA has a long history in astronomy, its application to elemental 
abundance analysis is relatively new. The only such application we know of is the 
study of Ting et al. (2012), who used this technique to investigate the distributions 
in elemental abundance space (hereafter, C-space; Freeman and Bland-Hawthorn 
2002) defined by various samples of stars from the disk, halo, clusters, and satellite 

'Although our sample includes some subgiant stars, we will describe it as "bulge dwarfs" for 
brevity. 
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galaxies. They found that disk stars occupied about 6 dimensions within the 17- 
dimensional C -space of the data, but the nucleosynthetic processes likely respon- 
sible for the lowest order dimensions changed as a function of [Fe/H]. Their work 
demonstrated the potential usefulness of principal component abundance analysis 
(PCAA) as a way to identify groups of stars with distinct enrichment histories. 
Here we apply this approach to bulge dwarfs to shed further light on the formation 
of the Galactic bulge. 

2. Method 

PCAA defines a new set of orthogonal basis vectors in C -space whose compo- 
nents are chosen to align with the maximum variation within the data not already at- 
tributed to lower order components. We use standard PCA (see, e.g., Jolliffe 1986) 
with the data matrix { d,j } representing the logarithmic abundance^ relative to iron 
of element j for star i: d t j = [Xj/Fe] with Xj = O, Na, Mg, Al, Si, Ca, Ti, Cr, 
Ni, Zn, Y, and Ba for j = 1, 2, 12. PCA identifies orthogonal eigenvectors 
£k = {e/cj} such that the abundance of a given element in a given star can be repre- 
sented as a sum 

djj =dj + J^Ci, k e k j, (1) 
k=i 

where dj = ^— dij is the mean value of [Xj/Fe] in the full data set and c,-^ 
" tar 1=1 

is the coefficient for the k PC of star i. The first PC describes the direction in 
elemental abundance space along which the sample stars exhibit the greatest vari- 
ation, the second PC describes the direction of the second largest variation, etc. If 
the number of principal components in the sum is equal to the number of elements 
measured, then the data can be represented exactly. However, if elemental abun- 
dances are correlated so that stars are restricted to a lower dimensional subspace, 
then the elemental abundances can be represented to good accuracy by a smaller 
number of PCs. 

Our data set comes from the homogeneous elemental abundance and error re- 
analysis of microlensed bulge dwarfs and subgiants by Bensby et al. (in prep.); 
abundances for 26 of our 35 sample stars have been previously published (Epstein 
et al. 2010, Bensby et al. 2010, 2011). 

3. Results 

The dimensionality of the subspace occupied by stars within the full C-space 
can be expressed as the number of PCs required to explain the intrinsic variation 



2 Elemental abundances are defined as [X/K] = log^x/AV) — log^x/AV)© , with missing data 
replaced by the average value of that elemental abundance for the other stars. 
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in the data (i.e., the variation not attributable to observational errors). Ting et al. 
(2012) used Monte Carlo simulations spanning a range of intrinsic dimensionality 
and variance to show that the true dimensionality was recovered when the cumu- 
lative variation of the first k PCs was about 85%. For our sample of microlensed 
bulge dwarfs, the first 1, 2, and 3 PCs describe 64%, 77%, and 84% of the cumu- 
lative variation within the data. Although the threshold for completely describing 
the data likely depends on many factors including sample size, number of observed 
abundances, and abundance uncertainties, it is fair to say that the bulge dwarfs oc- 
cupy approximately three dimensions of the 12-dimensional C-space investigated 
here. 



i.o, — , — , — , — , — , — , — , — , — , — , — , — , 1 1.0, 
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Fig. 1. A graphical representation of the PCI (panel a) and PC2 (panel b) vectors of the microlensed 
bulge dwarfs (circles/solid lines) and the disk dwarf sample of Bensby et al. (2003, 2005, and in 
prep.) (triangles/dotted lines). The PCI and PC2 eigenvector components refer to e\ j and e2j, 
respectively, from Eq. (1), with uncertainties from bootstrap resampling. We also show PC2 for the 
bulge dwarfs if MOA-2009-BLG-259S is omitted from the sample (crosses/dashed line in panel b); 
PCI remains nearly unchanged, so it is not plotted for clarity. Each symbol represents an abundance 
([XfFs]) and thus a dimension in the original C-space. 

Fig. 1 shows the first two PCs derived from the observed bulge dwarf elemen- 
tal abundances, with the uncertainties determined from bootstrap resampling (see 
below). PCI is dominated by the abundances of oxygen, other a -elements (Mg, 
Si, Ca, and Ti), and Al, with small uncertainties on the relative contributions of 
each abundance. SNe II are the primary source of a -elements and Al, but they 
are also a significant producer of Fe, especially at early times. By contrast, SNe 
la create large amounts of Fe and other Fe-peak elements, leading to sub-solar 
[oc/Fe] yields; once enough time has elapsed for a substantial number of SNe la to 
occur, they become the dominant source of Fe. The interplay between these two 
nucleosynthetic sources is thought to underpin the dichotomy in [oc/Fe] observed 
in Galactic disk stars, with high [oc/Fe] for "thick disk" stars reflecting rapid for- 
mation (e.g., Fuhrmann 1998) and roughly solar [oc/Fe] for "thin disk" stars, which 
have predominantly higher [Fe/H] (Gilmore and Wyse 1985). Although PCA is 
a "blind" statistical technique with no a priori theoretical input, in this data set 
(and others we have explored) the first principal component picks up this expected 
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distinction between the two dominant supernova enrichment mechanisms. 

PC2 is primarily governed by Na with secondary contributions from Ni (cor- 
related with Na) and Ba (anticorrelated with Na). A similar Na-Ni correlation, 
attributed to metallicity-dependent SN II yields, has been found previously in halo 
stars (Nissen and Schuster 1997, 2010); however, this study and the companion pa- 
per of Bensby et al. (in prep.) are the first to identify a Na-Ni correlation amongst 
bulge stars (see Bensby et al. in prep, for more details). Na is primarily produced 
by hydrostatic carbon burning in the massive stars that explode as SNe II, but pro- 
ton capture at the same temperatures depletes the pre-explosion Na abundance. As 
metallicity increases, the neutron excess increases, making Na less susceptible to 
proton capture and consequently increasing the overall Na yield (Clayton 2003). 
Similarly, the yield of 58 Ni (the most common Ni isotope) from SNe II is sen- 
sitive to the neutron excess and the abundance of neutron-rich nuclei, like 23 Na, 
in the progenitor star (Woosley et al. 1973). Additional significant Ni produc- 
tion occurs in SNe la, whose 58 Ni yield increases with metallicity (Timmes et al. 
2003). However, the Na-Ba anticorrelation is not readily explained by a single 
nucleosynthetic process. One star with distinctive abundances, MOA-2009-BLG- 
259S (see Fig. 3c), has a large impact on the contributions of Zn, Y, and Ba to PC2 
and drives up the uncertainties for these abundances. The crosses/dashed line in 
Fig. lb show the effect of omitting this star when defining principal components; 
PCI hardly changes, but the contributions of Zn and Y to PC2 switch from being 
correlated with Na to anticorrelated. The [Y/Fe] for MOA-2009-BLG-259S has a 
large uncertainty, though the elevated [Zn/Fe] = 0.44 ±0.17 appears to be well- 
established (Fig. 3c). Regardless of MOA-2009-BLG-259S, PC2 is dominated by 
Na and shows a Na-Ni correlation and a Na-Ba anticorrelation. 

For comparison, we have found the principal components of a sample of 702 
solar neighborhood thin and thick disk dwarfs from Bensby et al. (2003, 2005, and 
in prep.) with the same elements measured. The disk PCI and PC2 are shown as 
triangles/dotted lines in Fig. 1. The clear similarity between the bulge and disk 
PC Is suggests that the relative enrichment from SNe II vs. SNe la is the main 
driver of diversity among stars in both samples. On the other hand, the disk and 
bulge PC2s do not resemble each other: in contrast to the bulge PC2 discussed 
above, the disk PC2 is dominated entirely by correlated Y and Ba, both neutron- 
capture elements produced mainly by the s -process in disk stars (Sneden et al. 
2008). The short vs. long lifetimes of the nucleosynthetic sources driving the bulge 
and disk PC2s, respectively, implies that the bulge formed too rapidly (<Gyr) for 
asymptotic giant branch stars to dominate PC2. Johnson et al. (2012) independently 
reached a similar conclusion based on the relative abundances of r- and s -process 
elements. 

Fig. 2 shows the distribution of the 35 bulge dwarfs in (PCI, PC2)-space, with 
the upper panel showing the histograms of the bulge dwarfs in PCI and of kine- 
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Fig. 2. Panel (a): the histograms of bulge dwarfs (gray) and of kinematically selected thin (red) and 
thick (blue) disk dwarfs projected onto the bulge PCI axis defined above. Panel (b): the distribution 
of bulge dwarfs in (PCI, PC2)-space, color-coded by [Fe/H]. The square and triangle represent the 
PC2 outlier (MOA-2009-BLG-259S) and the PC 3 outlier (MOA-2010-BLG-523S), respectively. The 
positions along PCI and PC2 correspond to c, i and qi. respectively, in Eq. (1). 



matically selected subsets of thin and thick disk dwarf^] projected onto the bulge 
PCI. It is evident from visual inspection that the bulge dwarfs divide into two 
distinct groups along the PCI axis, centered at PCI values of ±0.3. (Because 
the sum in Eq. 1 includes the mean elemental abundances of the sample, a value 
of PCI = —0.3 corresponds to ~[a/Fe] Q .) Thus, this analysis of relative ele- 
mental abundances, with no direct input from [Fe/H], recovers the bimodality that 
Bensby et al. (201 1) found in the [Fe/H] distribution without reference to relative 
abundances. Bensby et al. (2011) did find elevated [a/Fe] ratios for the metal- 
poor bulge dwarfs, and we recover the same correlation in this "reverse" analysis: 
every star with PCI < —0.1 has [Fe/H] > —0.02, and all but two of the stars 
with PCI > -0.1 have [Fe/H] < -0.18. Of these two stars, one (MOA-2009- 
BLG-259S; square) is a clear PC2 outlier, while the other (MOA-2010-BLG-523S; 
triangle) is a moderate PC3 outlier that is undistinguished in (PCI, PC2)-space. 
Fig. 2a shows that the metal-rich and metal-poor bulge dwarfs track the thin 



The kinematic designation is based on the Bensby et al. (2003) selection criteria; however, we 
adopt a more stringent cut on the relative thick/thin disk membership probability: stars with P (thick 
disk)/ /'(thin disk) > 100 and P(thick disk)/P(thin disk) < 0.01 are designated thick and thin disk 
stars, respectively. 
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and thick disk dwarfs, respectively, in PCI. PCAA highlights the scarcity of stars 
with intermediate [oc/Fe] (PCI ~ 0) in the bulge dwarfs and in the thin and thick 
disk dwarfs. The fact that the two bulge populations track the two disk populations 
in [Fe/H] and PCI suggests that the stars had a -enrichment histories that produced 
the same abundance patterns. This similarity provides tentative evidence that the 
bulge formed through secular processes, such as disk instabilities, that heated in- 
ner thin and thick disk stars to form the bulge (Kormendy and Kennicutt 2004). 
Alternatively, the bulge and disk could have formed by distinct mechanisms but ex- 
perienced "convergent" enrichment histories. For example, Bournaud et al. (2007) 
propose that the large cosmological accretion rates of high redshift galaxies enable 
the rapid (^0.5-1 Gyr), simultaneous formation of the bulge and inner disk, which 
could account for the a -enhanced subpopulation of each component. 

To test the statistical significance of PC 1 and PC2 and the robustness of the bi- 
modality in Fig. 2, we created 100 bootstrap resamplings of the data set, redefining 
principal components each time. PCI is always recovered, and the general form 
of PC2 (high Na value, Na-Ni correlation, and Na-Ba anticorrelation) is recov- 
ered in 99/100 resamplings. Thus, PC2 is statistically significant even though the 
contributions of Zn, Y, and Ba to PC2 fluctuate because of its sensitivity to MOA- 
2009-BLG-259S. The histogram of PCI values is always multimodal, showing two 
distinct groupings (as in Fig. 2a) about 90% of the time; the remaining resamplings 
show three apparent groupings, but the small sample size makes the distribution 
of values within the PCI > group difficult to characterize reliably. The signifi- 
cance of a formal test for bimodality will depend on the form of the adopted null 
hypothesis, but the likelihood ratio of a 2-Gaussian fit to the PCI distribution to a 
1-Gaussian fit is 5£ (2G)/J£ (1G) = 2 x 10 5 , a large improvement for the addition 
of two free parameters. 

Fig. 3 shows the decompositions of the elemental abundance patterns of four 
sample stars into the sum of the sample mean and the first one, two, or three PCs. 
The best fit coefficients c,-^ of Eq. (1) are found for each star by % 2 -minimization, 
treating all errors as independent. Panels (a) and (b) show typical examples of 
metal-poor and metal-rich stars, respectively, each of them fit almost perfectly with 
a single PC. Panel (c) shows MOA-2009-BLG-259S, which is poorly fit by PCI 
alone but well fit when PC2 is also included. Panel (d) shows MOA-2010-BLG- 
523S, which has one of the largest PC3 coefficients in our sample. 

Clearly these PCA fits have low values of % 2 ed = % 2 /(d.o.f.), once a sufficient 
number of PCs (one, two, or three) is included. The number of degrees of freedom 
is the number of elemental abundances measured for the star minus the number of 
PCs in the fit. Panel (e) shows the histogram of % 2 ed values for single-PC and 2-PC 
decompositions; median values are % 2 ed =0.31 and 0.21, respectively. The low 
% 2 ed values indicate that the observational errors on the elemental abundances are 
typically overestimated, at least in the sense that they do not represent the variance 
in estimated elemental abundance that would arise from observing the same star 
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many times. We believe that these low %^ ed values arise because the quoted errors 
effectively incorporate a component of systematic calibration error in addition to 
random error. For example, multiple lines of the same element may give discrepant 
abundance estimates because of uncertainties in the assumed oscillator strengths; 
the internal dispersion of these estimates is one useful indication of the absolute 
elemental abundance uncertainty, but the range of elemental abundance measure- 
ments from multiple high-S/N spectra will be smaller than this dispersion because 
the same oscillator strengths are assumed each time. The treatment of random and 
systematic errors and error correlations is a significant issue for PCAA and other 
model-fitting approaches, but we defer it to future investigations. 

4. Conclusions 

Our results confirm and extend the findings of Bensby et al. (201 1), who identi- 
fied bimodality in the [Fe/H] distribution of microlensed bulge dwarfs, with metal- 
poor stars showing enhanced a -element abundances like those of solar neighbor- 
hood thick disk stars. Our principal component analysis demonstrates that the bi- 
modality found by Bensby et al. (2011) is present even in the relative elemental 
abundances ([XfFe]) of the bulge dwarfs alone. The first PC is dominated by oc- 
elements, reflecting the dichotomy between SN II and SN la enrichment. The sec- 
ond PC captures the Na-Ni correlation caused by the metallicity dependence of SN 
II yields. Intriguingly, the two metal -rich stars that exhibit a -enhancement (MOA- 
2009-BLG-259S and MOA-2010-BLG-523S) are also outliers from the main locus 
of stars in (PCI, PC2, PC3)-space, suggesting that they do indeed have unusual en- 
richment histories. If we project the thin and thick disk dwarfs onto the bulge dwarf 
PCI, they occupy the locations of the metal-rich and metal-poor bulge dwarfs, re- 
spectively. Analyzing local disk dwarfs, we find that the first principal component 
is nearly identical to the bulge PCI. However, the disk PC2 is governed by Y and 
Ba, products of long-lived asymptotic giant branch stars, whereas enrichment from 
short-lived SNe II drives the bulge PC2. Qualitatively, these results support a sce- 
nario in which the bulge grows by secular evolution from the inner disk, which 
itself has the elemental abundance dichotomy seen in local populations, but there 
may be other bulge formation models that can produce similar results. 

Our results, and those of Ting et al. (2012), illustrate the potential of PCAA as 
a tool for characterizing the distribution of stars in high- dimensional C -space. One 
application, highlighted here, is to identify subpopulations in a sample, drawing 
on the information present in all measured elemental abundances simultaneously. 
With large multi-element samples, this approach could be used to isolate "inter- 
loper" stars accreted from a dissolved satellite, and perhaps to identify cohorts of 
stars associated with common birth clusters (Freeman and Bland-Hawthorn 2002). 
A second application, illustrated by the examples of MOA-2009-BLG-259S and 
MOA-2010-BLG-523S, is to identify outlier stars, either through their unusual lo- 



Vol.0 



9 



cations in PC-space or because they are poorly fit by combinations of PCs that 
fit most stars well. Such outliers may reveal rare but physically informative en- 
richment pathways. A third application, highlighted by Ting et al. (2012), is to 
characterize the dimensionality of the stellar distribution in C-space, which is a 
basic test for models of Galactic assembly and enrichment. We will explore this 
technique in future work using theoretical models. 

By allowing the data themselves to define the directions of strongest variation, 
PCAA complements the usual approach of testing predictive models that adopt nu- 
cleosynthetic yields from theoretical calculations. The PCs clearly do have physi- 
cal content, but the connection of PCs to enrichment mechanisms is not one-to-one 
(see Ting et al. 2012), and interpreting them will require comparisons to mod- 
els that vary both enrichment and mixing histories and the nucleosynthetic yields 
themselves. Our analysis identifies several practical complications of PCAA, in- 
cluding the potential sensitivity to outliers in small samples, the treatment of miss- 
ing elemental abundance measurements, the impact of heteroscedastic errors and 
correlated errors, and the mix of random and systematic contributions to the errors 
quoted in observational analyses. We will investigate these issues in future work. 
The enormous samples of high-resolution spectra anticipated from the SDSS-III 
APOGEE survey (Majewski et al. in prep., Eisenstein et al. 2011), the Gaia-ESO 
survey (Gilmore et al. 2012), and the HERMES survey (Barden et al. 2010) will 
map the elemental abundance distribution over wide swaths of the Galaxy, and 
PCAA will be a valuable tool for connecting these measurements to a comprehen- 
sive theory of the formation of the Milky Way. 
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Fig. 3. Panels (a)-(d) show elemental abundances for, respectively, a metal-poor star, a metal-rich 
star, the PC2 outlier star (MOA-2009-BLG-259S), and the PC3 outlier star (MOA-2010-BLG-523S). 
Lines indicate the best fit combined abundance pattern using the mean abundance and the first PC 
(solid gray line), the first two PCs (short dashed line), and the first three PCs (long dashed line in 
panel d only). Reduced- % 2 (Xred) va l ues f° r these fits are listed in the legends. Panel (e) shows the 
distribution of y£ eA values for the bulge dwarf sample, with the same line types as panels (a)-(c). 



